智能论文笔记

FaceQvec: Vector Quality Assessment for Face Biometrics based on ISO Compliance

Javier Hernandez-Ortega , Julian Fierrez , Luis F. Gomez , Aythami Morales , Jose Luis Gonzalez-de-Suso , Francisco Zamora-Martinez

分类：计算机视觉 | 机器学习

2021-11-03

在本文中，我们开发FaceQVEC，一种软件组件，用于估计ISO / IEC 19794-5中所考虑的每个要点的面部图像的符合性，这是一个质量标准，该标准定义了将它们可接受或不可接受的面部图像的一般质量指南用于官方文件，如护照或身份证。这种质量评估的工具可以有助于提高面部识别的准确性，并确定哪些因素影响给定的面部图像的质量，并采取行动消除或减少这些因素，例如，具有后处理技术或重新获取图像。 FaceQVEC由与上述标准中预期的不同点相关的25个单独测试的自动化，以及被认为与面部质量有关的图像的其他特征。我们首先包括在现实条件下捕获的开发数据集上评估的质量测试的结果。我们使用这些结果来调整每个测试的判定阈值。然后，我们再次在评估数据库中再次检查，该评估数据库包含在开发期间未见的新脸部图像。评估结果展示了个人测试的准确性，用于检查遵守ISO / IEC 19794-5。 Faceqvec可在线获取（https://github.com/uam-biometrics/faceqvec）。

translated by 谷歌翻译

Ithaca. A Tool for Integrating Fuzzy Logic in Unity

Alfonso Tejedor Moreno , Jose A. Piedra-Fernandez , Juan Jesus Ojeda-Castelo , Luis Iribarne

分类：人工智能

2023-01-01

Ithaca is a Fuzzy Logic (FL) plugin for developing artificial intelligence systems within the Unity game engine. Its goal is to provide an intuitive and natural way to build advanced artificial intelligence systems, making the implementation of such a system faster and more affordable. The software is made up by a C\# framework and an Application Programming Interface (API) for writing inference systems, as well as a set of tools for graphic development and debugging. Additionally, a Fuzzy Control Language (FCL) parser is provided in order to import systems previously defined using this standard.

translated by 谷歌翻译

S-Graphs+: Real-time Localization and Mapping leveraging Hierarchical Representations

Hriday Bavle , Jose Luis Sanchez-Lopez , Muhammad Shaheer , Javier Civera , Holger Voos

分类：机器人 | 人工智能

2022-12-22

In this paper, we present an evolved version of the Situational Graphs, which jointly models in a single optimizable factor graph, a SLAM graph, as a set of robot keyframes, containing its associated measurements and robot poses, and a 3D scene graph, as a high-level representation of the environment that encodes its different geometric elements with semantic attributes and the relational information between those elements. Our proposed S-Graphs+ is a novel four-layered factor graph that includes: (1) a keyframes layer with robot pose estimates, (2) a walls layer representing wall surfaces, (3) a rooms layer encompassing sets of wall planes, and (4) a floors layer gathering the rooms within a given floor level. The above graph is optimized in real-time to obtain a robust and accurate estimate of the robot's pose and its map, simultaneously constructing and leveraging the high-level information of the environment. To extract such high-level information, we present novel room and floor segmentation algorithms utilizing the mapped wall planes and free-space clusters. We tested S-Graphs+ on multiple datasets including, simulations of distinct indoor environments, on real datasets captured over several construction sites and office environments, and on a real public dataset of indoor office environments. S-Graphs+ outperforms relevant baselines in the majority of the datasets while extending the robot situational awareness by a four-layered scene model. Moreover, we make the algorithm available as a docker file.

translated by 谷歌翻译

Toroidal Coordinates: Decorrelating Circular Coordinates With Lattice Reduction

Luis Scoccola , Hitesh Gakhar , Johnathan Bush , Nikolas Schonsheck , Tatum Rask , Ling Zhou , Jose A. Perea

分类：机器学习 | (统计)机器学习

2022-12-14

The circular coordinates algorithm of de Silva, Morozov, and Vejdemo-Johansson takes as input a dataset together with a cohomology class representing a $1$-dimensional hole in the data; the output is a map from the data into the circle that captures this hole, and that is of minimum energy in a suitable sense. However, when applied to several cohomology classes, the output circle-valued maps can be "geometrically correlated" even if the chosen cohomology classes are linearly independent. It is shown in the original work that less correlated maps can be obtained with suitable integer linear combinations of the cohomology classes, with the linear combinations being chosen by inspection. In this paper, we identify a formal notion of geometric correlation between circle-valued maps which, in the Riemannian manifold case, corresponds to the Dirichlet form, a bilinear form derived from the Dirichlet energy. We describe a systematic procedure for constructing low energy torus-valued maps on data, starting from a set of linearly independent cohomology classes. We showcase our procedure with computational examples. Our main algorithm is based on the Lenstra--Lenstra--Lov\'asz algorithm from computational number theory.

translated by 谷歌翻译

A Review of Radio Frequency Based Localization for Aerial and Ground Robots with 5G Future Perspectives

Meisam Kabiri , Claudio Cimarelli , Hriday Bavle , Jose Luis Sanchez-Lopez , Holger Voos

分类：机器人

2022-12-09

Efficient localization plays a vital role in many modern applications of Unmanned Ground Vehicles (UGV) and Unmanned aerial vehicles (UAVs), which would contribute to improved control, safety, power economy, etc. The ubiquitous 5G NR (New Radio) cellular network will provide new opportunities for enhancing localization of UAVs and UGVs. In this paper, we review the radio frequency (RF) based approaches for localization. We review the RF features that can be utilized for localization and investigate the current methods suitable for Unmanned vehicles under two general categories: range-based and fingerprinting. The existing state-of-the-art literature on RF-based localization for both UAVs and UGVs is examined, and the envisioned 5G NR for localization enhancement, and the future research direction are explored.

translated by 谷歌翻译

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Teven Le Scao , Angela Fan , Christopher Akiki , Ellie Pavlick , Suzana Ilić , Daniel Hesslow , Roman Castagné , Alexandra Sasha Luccioni , François Yvon , Matthias Gallé

分类：自然语言处理

2022-11-09

Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by resource-rich organizations and are frequently kept from the public. As a step towards democratizing this powerful technology, we present BLOOM, a 176B-parameter open-access language model designed and built thanks to a collaboration of hundreds of researchers. BLOOM is a decoder-only Transformer language model that was trained on the ROOTS corpus, a dataset comprising hundreds of sources in 46 natural and 13 programming languages (59 in total). We find that BLOOM achieves competitive performance on a wide variety of benchmarks, with stronger results after undergoing multitask prompted finetuning. To facilitate future research and applications using LLMs, we publicly release our models and code under the Responsible AI License.

translated by 谷歌翻译

Robot Localization using Situational Graphs and Building Architectural Plans

Muhammad Shaheer , Hriday Bavle , Jose Luis Sanchez-Lopez , Holger Voos

分类：机器人

2022-09-23

建筑行业的机器人可以使用高精度数据捕获来通过不断监视工作进度来降低成本。准确的数据捕获需要在环境中精确的移动机器人定位。在本文中，我们介绍了有关机器人本地化的新颖作品，该工作以墙壁和房间的形式提取了从建筑计划中提取几何，语义以及拓扑信息，并创建了情境图的拓扑和度量语言层（S-图）在环境中导航之前。当机器人在施工环境中导航时，它使用机器人的探光仪和从3D LIDAR测量中提取的平面壁的形式的感觉观测来估算其依靠粒子过滤器方法的姿势，并利用先前构建的情境图和它可用的几何，语义和拓扑信息。我们在将其与基于传统几何的本地化技术进行比较时，在实际持续的施工站点上捕获的模拟和真实数据集中验证了我们的方法。

translated by 谷歌翻译

TempoWiC: An Evaluation Benchmark for Detecting Meaning Shift in Social Media

Daniel Loureiro , Aminette D'Souza , Areej Nasser Muhajab , Isabella A. White , Gabriel Wong , Luis Espinosa Anke , Leonardo Neves , Francesco Barbieri , Jose Camacho-Collados

分类：自然语言处理

2022-09-15

语言随着时间的流逝而演变，单词含义会发生相应的变化。在社交媒体中尤其如此，因为它的动态性质会导致语义转移的速度更快，这使得NLP模型在处理新内容和趋势方面具有挑战性。但是，专门解决这些社交平台动态性质的数据集和模型的数量很少。为了弥合这一差距，我们提出了Tempowic，这是一种新的基准，尤其是旨在加快基于社交媒体的含义转变的研究。我们的结果表明，即使对于最近发行的专门从事社交媒体的语言模型，Tempowic是一个具有挑战性的基准。

translated by 谷歌翻译

A Lightweight Universal Gripper with Low Activation Force for Aerial Grasping

Paul Kremer , Hamed Rahimi Nohooji , Jose Luis Sanchez-Lopez , Holger Voos

分类：机器人

2022-08-23

软机器人抓手具有许多优势，可以解决动态空中抓握方面的挑战。最近展示的用于空中抓握的典型多指的软握把高度依赖于成功抓握的目标对象的方向。这项研究通过开发一种用于自主空气操纵的全向系统来推动动态空中抓地力的边界。特别是，该论文研究了一种新型，高度集成，模块化，传感器富含通用的握把的设计，制造和实验验证，专为空中应用而设计。提出的抓手利用粒子堵塞和软颗粒材料的最新发展产生了强大的握持力，同时非常轻巧，节能，并且只需要低激活力。我们表明，通过在膜的硅硅混合物中添加添加剂，可以将持有力提高多达50％。实验表明，即使没有几何互锁，我们的轻质抓地力也可以以低至2.5n的激活力发育高达15n的持有力。最后，通过将抓地力安装到多旋风的情况下，在实际条件下执行了一个选择和释放任务。开发的空中抓握系统具有许多有用的属性，例如对碰撞的弹性和鲁棒性以及将无人机与环境脱离的固有的被动合规性。

translated by 谷歌翻译

Where is VALDO? VAscular Lesions Detection and segmentatiOn challenge at MICCAI 2021

Carole H. Sudre , Kimberlin Van Wijnen , Florian Dubost , Hieab Adams , David Atkinson , Frederik Barkhof , Mahlet A. Birhanu , Esther E. Bron , Robin Camarasa , Nish Chaturvedi

分类：计算机视觉 | 人工智能

2022-08-15

脑小血管疾病的成像标记提供了有关脑部健康的宝贵信息，但是它们的手动评估既耗时又受到实质性内部和间际变异性的阻碍。自动化评级可能受益于生物医学研究以及临床评估，但是现有算法的诊断可靠性尚不清楚。在这里，我们介绍了\ textIt {血管病变检测和分割}（\ textit {v textit {where valdo？}）挑战，该挑战是在国际医学图像计算和计算机辅助干预措施（MICCAI）的卫星事件中运行的挑战（MICCAI） 2021.这一挑战旨在促进大脑小血管疾病的小而稀疏成像标记的自动检测和分割方法的开发，即周围空间扩大（EPVS）（任务1），脑微粒（任务2）和预先塑造的鞋类血管起源（任务3），同时利用弱和嘈杂的标签。总体而言，有12个团队参与了针对一个或多个任务的解决方案的挑战（任务1 -EPVS 4，任务2 -Microbleeds的9个，任务3 -lacunes的6个）。多方数据都用于培训和评估。结果表明，整个团队和跨任务的性能都有很大的差异，对于任务1- EPV和任务2-微型微型且对任务3 -lacunes尚无实际的结果，其结果尤其有望。它还强调了可能阻止个人级别使用的情况的性能不一致，同时仍证明在人群层面上有用。

translated by 谷歌翻译